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The continuous-time query model is a variant of the discrete query model in which queries can be 
interleaved with known operations (called "driving operations") continuously in time. Interesting 
algorithms have been discovered in this model, such as an algorithm for evaluating nand trees more 
efficiently than any classical algorithm. Subsequent work has shown that there also exists an efficient 
algorithm for nand trees in the discrete query model; however, there is no efficient conversion known 
for continuous-time query algorithms for arbitrary problems. 

We show that any quantum algorithm in the continuous-time query model whose total query 
time is T can be simulated by a quantum algorithm in the discrete query model that makes 
0(T log T/ log log T) C 0{T) queries. This is the first upper bound that is independent of the 
driving operations (i.e., it holds even if the norm of the driving Hamiltonian is very large). A corol- 
lary is that any lower bound of T queries for a problem in the discrete-time query model immediately 
carries over to a lower bound of ^^(T log log T/ log T) C ^{T) in the continuous-time query model. 



I. INTRODUCTION AND SUMMARY OF RESULTS 



In the query (a.k.a. black-box or oracle) model of computation, one is given a black box that computes the individual 
entries of an A/'-tuple, x = (xq, xi, . . . , xat-i), and the goal is to compute some function of these values, making as few 
queries to the black-box as possible. Many quantum algorithms can be naturally viewed as algorithms in this model, 
including Shor's factoring algorithm [1 , whose primary component computes the periodicity of a periodic sequence 
xo,Xi, . . . ,XAr-i (technically, the sequence must be also be distinct within each period). Other examples are ^^SjH]. 

In the quantum query model, a (full) quantum query is a unitary operation Qx such that 

Q.\j)\b) = \j)\b®xj), (1) 

for all j G {0, 1, . . . , N — 1} and b from the set of values that entries of the A^-tuple ranges over, and can be set to 
the bit-wise exclusive-or. Queries are interleaved with other quantum operations that "drive" the computation. The 
query cost of an algorithm is the number of queries that it makes. The efficiency of the other operations, besides 
queries, is also of interest. An algorithm is deemed efficient if it is efficient in both counts. 

When the tuple x consists of binary values, the form of a full query can be equivalently expressed as 

Q^\j)\b) = {-l)'-^\j)\b), (2) 

which is related to Qx from Eq. ([T]) via conjugation with a Hadamard transformation on the second register. For con- 
venience of notation, we can absorb the second qubit register b into the definition of x, by defining x' = (xq, . . . , ^2Ar-i) 
as XjQ = and x^^ = xj. Henceforth, we simply omit the parameter 6, and define a discrete query Qx as 

\j) = i-ir \j) , (3) 

for all j G {0, 1, . . . , N — 1}. (See [5] for more information about relationships between different forms of queries.) 

Farhi and Gutmann [6] introduced a continuous-time variant of the query model, where queries are performed 
continuously in time in the following sense. A query Hamiltonian Hx is defined as 

\j) = X, \j} , (4) 
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for j G {0, 1, . . . , A^ — 1}. Note that evolving under for time tt results in the full discrete query of Eq. (|3|. 
A quantum algorithm in the continuous-time query model is specified by: a driving Hamiltonian^ D{t)^ which is an 
arbitrary time-dependent Hamiltonian; an initial state |V^o); an execution time T > 0, and a measurement M of the 
final state. {D{t)^ I'^o): and M, are all functions of the input size N.) The input to the algorithm is embodied 
by a query Hamiltonian H^. In the execution of the algorithm, the initial state evolves under the Hamiltonian 
Hx + D{t) from time t = to time t = T. Measurement M of the resulting final state determines the output of the 
algorithm. 

The continuous-time query model has proven to be a useful conceptual framework for discovering new quantum 
algorithms [71 18] . Many algorithms in this setting can be converted to algorithms in the more conventional quantum 
query model [9l[T0]. However, it has not been previously shown that this can be done in general without incurring a 
significant loss in query complexity. 

The Suzuki- Trotter formula [TT] can be used to approximate a continuous-time algorithm by a sequence of full queries 
interleaved with unitary operations induced by D{t). This results in simulations of cost 0(exp(l/7^) T)^+^) for 
arbitrarily small > [T2l[T3] (the result was shown for the case of time-independent D{t)). Although this is "close 
to linear" in cases of interest where is bounded by a constant, if \\D\\ grows significantly as a function of the 
input size this approach fails to yield an efficient bound. Recent work by Childs [14^ gives a simulation of cost 
0(||D||T) that applies for all D{t) that are time-independent and with the additional property that their matrix 
entries are nonnegative. This raises the question of whether a "highly energetic" driving Hamiltonian can result in 
computational speedup over the discrete query model in some scenarios. The exponential cost in 1/r] for general 
driving Hamiltonians D{t) is also undesirable. 

For some problems — such as searching for a marked item or computing the parity of the input bits — it is already 
known that the continuous-time model provides no asymptotic reduction in query cost [6 (regardless of D(t)). Mo- 
chon [15] raised the question of whether this equivalency remains valid in general: most known lower bounds only 
apply to the number of full queries needed to solve a problem, leaving open the possibility that these lower bounds 
could be circumvented using continuous-time queries. We show that this cannot happen and essentially answer Mo- 
chon's question by showing that any algorithm in the continuous-time query model whose total query time is T can 
he simulated by an algorithm in the quantum query model that makes 0{T) queries. More specifically, we prove the 
following theorem: 

Theorem 1. Suppose we are given a continuous-time query algorithm with any driving Hamiltonian D{t) whose sup 
norm \\D{t)\\ is hounded above by any Li function with respect to t. (The size of \\D{t)\\ as a function of the input 
size N does not matter.) Then there exists a discrete-time query algorithm that makes 

( riog(rA) \ 

V£iogiog(T/£); ^ ^ 

full queries and whose answer has fidelity 1 — e with the output of the continuous-time algorithm. If the continuous- 
time query algorithm acts on a classical input (for instance, if it computes a function f{x) of the oracle x), the 
discrete-time query algorithm makes 

^ / Tlog(T/g)log(l/g) \ 



loglog(T/£) 



full queries. 



Note that this implies that any lower bound of T proven for the discrete query model automatically yields a lower 
bound of l](TloglogT/logT) C ^(T) for the continuous-time case. In addition, any algorithm in the discrete query 
model using T full queries can be easily simulated by a continuous-time algorithm running for time 0{T). This can 
be done with a driving Hamiltonian that rapidly swaps qubits to effectively turn on and off the query Hamiltonian. 
Thus, the two models (discrete and continuous queries) are equivalent up to a sub-logarithmic factor. 



A. Rough overview of the proof of Theorem [T] 

Here we provide a rough sketch of the proof of Theorem [l] a more detailed exposition is in Section [H] Starting with 
a continuous-time query algorithm, we apply the following sequence of transformations to it. 

A. Convert to a fractional query algorithm. Using a suitable Trotter- Suzuki type of approximation, the algo- 
rithm can be simulated by interleaved executions of D{t) and for small amounts of time. The approximation 
uses about p = \\D\\T'^ / e time slices, each of length T/p for precision fidelity 1 — e. This does not readily convert 
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into an efficient discrete-time query algorithm because the straightforward way of simulating each evolution 
uses full discrete-time queries, even though the time evolution is very small. The total discrete query cost would 
be 0{\\D\\T'^ /e) (and even the reduced exponent of T resulting from a high-order Suzuki formula would not 
affect the dependence on which could potentially be very large). 

B. Simulate fractional queries with low amplitude controlled discrete queries. We use a construction that 

permits each ij^^; smal l-time evolution to be simulated by a single con^ro//eG?-discrete query with control qubit 
in state ^ Y^T^^T72p|0) + i^/T/2p\l). This construction succeeds conditional on a subsequent measurement 
outcome. The success probability is approximately 1 — T/p. It is therefore very likely that roughly T of the 
p fractional queries will fail, and a procedure for correcting these post-selection failures is explained in step D 
below. 

C. Approximate segments of control qubits by low-Hamming- weight states. Part D will require us to divide 

the computation into segments, each involving m small-time evolutions of H^. For each segment, the collective 
state of the m control qubits is |(/)) ^ {y^l - T/2p |0) +z v'T/2p We can construct another m-qubit state 

1^') such that > 1 — e/T and such that is a superposition of basis states with Hamming weight 

only 0{m{T/p) log(T/£)/ log log(T/£:)). The Hamming weight of the control qubits is effectively the number of 
full queries performed. By rearranging the circuit, we make this association explicit, allowing us to truncate 
the circuit to deal with only the typical case, and thus reduce the total number of full queries needed for the 
segment to only 0{m{T/p) log(T/£:)/ loglog(T/£)). 

D. Correct the post-selection errors for each segment. Returning to the post-selection errors in the simula- 

tion of the fractional queries, they are corrected by dividing the computation into segments of sufficiently small 
size so that: (a) there are 0{T) segments to simulate; (b) the expected number of errors per segment is < 1/8. 
The post-selection results for each segment reveal exactly where any errors occurred, making it possible to 
"undo" the segments and then attempt to compute them again. This process is applied recursively, since new 
errors can arise during these steps. We show that this process only increases the expected number of segments 
simulated (including those that arise from corrections) by a constant factor. The result is 0{T) simulations of 
segments with an expected number of discrete queries 0(Tlog(T/£:)/ log log(T/£:)). Applying the Markov bound 
and standard amplification techniques leads to the query complexity in the statement of Theorem [l] 



Our manuscript is organized as follows. In Subsecs. [H A[ |HB[ |H C[ and |nD| we describe the simulati on o f a 



continuous-time T query algorithm by a discrete 0(TlogT/loglogT) query algorithm in detail. In Subsec. 
estimate the amount of full queries as a function of the output fidelity. Concluding remarks are in Section II 
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II. DISCRETE QUERY SIMULATION OF CONTINUOUS-TIME QUERY ALGORITHMS 

To obtain a discrete query simulation, we need a discretization of the continuous-time evolution performed by the 
algorithm. For this reason, we define a fractional query as the operation 

Qi\j)=e-''"-\j) = e-^<'-^\j), (7) 

for j G {0, 1, . . . , A/" — 1}, and its fractional cost is \0\/7t. We assume — tt < < tt. When 6> = tt, this is a full query, as 
defined by Eq. ([3|. A fractional query algorithm alternates driving unitaries and fractional queries, and its fractional 
query cost is the sum of the fractional costs of its queries. 

It is straightforward to approximate a continuous-time algorithm with continuous query time T, by a fractional 
query algorithm whose total fractional query cost is T/tt — but whose actual number of fractional queries p may be 
much larger than T. Since a fractional query can be easily simulated using two full queries (Fig.jl]), an algorithm that 
makes p ^ T fractional queries would be simulated using 2p full queries. This yields an undesirably large overhead 
for the discrete simulation of the original continuous-time algorithm. (The problem of simulating fractional powers of 
arbitrary unitary black-boxes is studied in [16 ). We introduce another method to approximate fractional queries by 
full queries with little loss in efficiency. What we show is that there is a way of organizing the structure of the driving 
and query operations so that many of the full queries may be omitted with only a small loss in accuracy. The overall 
procedure can be made to succeed with constant probability, and when it succeeds, the resulting state has very high 
fidelity to the state output by the original continuous-time algorithm. 
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FIG. 1: Simulation of the fractional query of Eq. ([7| using two full queries Qx controlled in the state |1) of an ancilliary qubit. 
The operation Rq, = exp(— i^^(]l — ax)/^) applies the desired phase to the state |— ) of the ancilla, with |ib) = [|0) di |1)]/a/2. 
The operator ax is the Pauli bit-flip operator. 



A. Converting a continuous-time query algorithm to a fractional query algorithm 

This subsection shows how to simulate a continuous-time query algorithm in terms of a fractional query algorithm 
that is efficient in terms of its fractional query cost. We construct this simulation through a straightforward application 
of a time-dependent version of the Trotter formula. For arbitrary precision e > 0, the Trotter- Suzuki approximation 
allows us to approximate a continuous-time T query algorithm using fractional queries, such that the fractional query 
cost is T/tt. The construction depends on the average norm (or average action) of the driving Hamiltonian, defined 



\Dmdt. 



(8) 



Here, || • || is the sup-norm defined as \\H\\ = sup|^^ 11^ IV^) 11/11 IV^) II- We assume that ||l^(t)|| is an Li function, so 
that r is well-defined. (Actually, since we only need r as an upper bound, it is sufficient that ||l^(t)|| is bounded above 
by an Li function.) Although the number of fractional queries grows proportionally to r, our simulation technique 
ultimately results in a number of discrete queries that is independent of the value of r. For fidelity 1 — ei, it is sufficient 
to decompose [0,T] into p > 2T^r/y^ subintervals of size T/p. The fractional query algorithm alternates between 
evolution under D{t) and evolution under Hx for time 6 = T/p. 

To handle the case of time-dependent Hamiltonians, we apply an extension, due to [17 , of the first-order Trotter 
product formula to time-dependent Hamiltonians. For any time-dependent Hamiltonian A{t), let UAih^ta) denote 
the unitary operation corresponding to Schrodinger evolution under A{t) from t = ta to t = ti) [18]. Then, by [17 , 



x-\-5 py 



\\Ua+b{x + S,x)-Ua{x + 5,x)Ub{x + S,x)\\< / \\[A{y),B{z)]\\dzdy. 

J X J X 

When Bit) is constant, with \\B\\ = 1, this simplifies to 

\\Ua+b{x + 5,x)- Ua{x + S, x)Ub{x + 5, x)\\ < 25 f^' \\A{y)\\ dy. 

J X 

In our algorithmic context, we replace A{t) with D{t) and B{t) with H^^ and we define the unitaries 

Vk = Uoiik + 1)^, kO) and Wk = Ud+h^ {{k + 1)^, kO). 



Then, by Eq. ([To]), for all A; G {0, 1, . . . , p - 1}, 

\\Wk-Vke-''''-\\<20 

from which it follows that 



ke 



\D 



dt , 



(9) 

(10) 
(11) 
(12) 



P-1 rik+1) 



\\D{t)\\dt 



k=0 

= 20Tr 



(13) 
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It follows that, if IV^i) is the final state of the continuous-time algorithm, and \ip2) is the final state of the approximating 
fractional query algorithm, and p = [2T^r/y£i] then 

\{H^2)\>Vl^i- (14) 

In Fig. |2]we show the si- approximation to the algorithm in the continuous-time query model. We assume that Vk 
and Wk act on a set of n qubits (where n > log(A^)), but extensions to larger-dimensional systems are possible. We 
refer to these n qubits as the system to distinguish them from additional qubits (ancillas) that will be introduced 
later. Although the total fractional query cost is T/tt, it should be noted that the total number of fractional queries 
is T'^T^J2| which may be much larger than T. (No assumption is made about the value of r.) For this reason, 
the full-query simulation obtained by replacing each fractional query by the circuit of Fig. [l] may yield an undesirable 
overhead. Our full construction will instead result in a discrete query cost that is 0(TlogT/ loglogT), independent 
of r and e\. 



v. 



2^1 



FIG. 2: Circuit approximation of a continuous-time T query algorithm. Each of the p fractional queries realize the evolution 
given by Eq. Q, with 6* = T/p G 0(^/(rT)). 



B. Simulating fractional queries with low amplitude controlled discrete queries 



This subsection shows how to replace every fractional query in Fig. [2] by the probabilistic simulation of Fig. [sj 
Without loss of generality, we may assume < 6> < tt. The idea is to add an ancillary (control) qubit initially in |0; 
and act on it by R\ as 



J?i|0) 



a/cos 6*72 |0) +i\/sin 61/2 |1) 



(15) 



with V = cos 6*72 + sin6'/2. The full query is then implemented controlled on the state |1) of the ancilla (i.e., a 
controlled-Qa; operation), and the ancilla is acted on by R2, given by 



R2 



^[vW^io) + yihr^ii)] , 

^[yW^IO)- yW^ll)] . 



(16) 



Finally, a projective measurement in the computational basis of the ancilla is performed. 

To show that the above algorithm implements a probabilistic simulation of Q^, we write, for all G (— 7r,7r], 



-id 12 



cos((9/2)]l + isin((9/2)Q^]. 



The ancilla-system state before the measurement is 

1 



|0) + v^e-^"/^ |1) Q-^l^ I 



(17) 



(18) 



The measurement in the ancilla projects the state of the system into (up to irrelevant global phase factors) \\\)) 
or Qx^^'^ IV^), with probabilities Ps = 1/'^^ and pf = 1 — Ps^ respectively. Since = T/p G 0{e/{rT))^ we obtain 
sinl9/2 < 6/2 = T/(2p), and thus Ps > I - T/p. 

For reasons described in Subsec. II D[ our final simulation also requires the probabilistic implementation of the 
(conjugated) fractional query ^ with similar success probability. We achieve this by replacing Ri with R'^ = cFzRi 



6 



|0> 















{ 


Ri 






R2 









|0) ® e''/^Qi 1^) 



>i-T/p, 



|1>' 



-i7v/4 



FIG. 3: Probabilistic simulation of the fractional query using a single discrete query Qx controlled on the state |1) of a 
control qubit. After the measurement, is performed with a success probability > 1 — T/p. Successful simulation occurs 
if the state of the control qubit is projected into |0). 



in the circuit of Fig. [3] The operator is the diagonal Pauli operator of the ancilla. In the event of failure (i.e., if 
the ancilla state is projected into |1) after measurement), the system state is acted on by the operation QS^^ up to 
irrelevant global phase factors. We usually refer to the undesired operations QZ^'^ and Qx^^'^ implemented in a failed 
simulation as errors. 



C. Approximating segments of control qubits by low Hamming weight states 



This subsection shows how the low-amplitude controlled discrete queries from the previous subsection can be 
efficiently approximated by full queries. Our construction makes sense for any contiguous segment consisting of m of 
the controlled discrete queries, as long as m < p and m G Q{l/0). For the purposes of the error-correcting procedure 
in the next subsection, we set m so that mO = 1/4 [20 . In Fig.|4]we show the first size-m segment (m is the number of 
full queries) appearing in the original circuit of Fig. |2] after each fractional query has been replaced by its probabilistic 



simulation, as explained in Subsec. II B 
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FIG. 4: The first size-m segment obtained by replacing every in Fig. |2| by the probabilistic simulation of Fig. [3| If 
m = [1/(4^) J, the total probability of success after measurement is bounded below by 3/4. Successful simulation occurs if 
every ancilla is projected into |0) after measurement. The operator U denotes the action induced by the operations inside the 
dashed box. The overall unitary action before the measurement is Rf^URf^. 



We begin by observing that, since the state of all m control qubits after the action of the operations Ri is 

= Hfm |o^®n. ^ _l_ (y^^lO) , (19) 

and m(l/'u) sin^/2 ^ 1/8, the amplitudes of this state are concentrated at the m-qubit states (in the computational 
basis) with small Hamming weight. This means that we can approximate |x) by a superposition of sufficiently low 
Hamming weight basis states. Intuitively, the Hamming weight of the control qubits corresponds to the number of 
queries actually performed, suggesting this is a step in the right direction. 

To make this approximation precise, let A(-) denote Hamming weight, and define 



ze{0,l}'^,A{z)<k 



(20) 
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the projector into basis states \z) of Hamming weight at most k. Also define 

Ix') = -^^^^=. (21) 

Then, since |(x1x)P is the sum of the absolute values squared of all amplitudes in |x) for basis states of Hamming 
weight up to /c, 

Kx'lx)!' = 1 - E (7) " ^)'""'^'' (22) 



j>k 



where 



cos^/2 + sin^/2 Sm \w?)' ^ ' 

For asymptotically large m (or T), this is essentially the probability that a Poisson distributed random variable with 
mean 1/8 is less than or equal to k, which is bounded below by 



1_ (1/^ 



Assuming that the number of segment computations performed is 0{T/e2)^ setting (xlx)^ ^ 1 — £2^3/^ is sufficient 
for the cumulative reduction in fidelity over all the segment computations to be below £3. To attain this, it is sufficient 
to set 



/ \og{Tle,e^) \ 
Vloglog(T/£2£3); 



(25) 



Although changing |x) to \x') suggests that the number of controlled full queries can be reduced to /c, the circuit in 
Fig. [4] must be rearranged to make this possible (as it is, it makes m queries regardless of k). The idea is to replace 
the controlled-queries interleaved with driving unitary operations in Fig. [4] with an equivalent circuit composed with 
fixed discrete queries interleaved with controlled driving unitaries. The form of the revised circuit is illustrated in 
Fig. [51 

The action of the gates Vq^. . . ^Vm is defined as follows. Let ih be the position of the h-ih. 1 in a state of the m 
control qubits, in the computational basis. (These states form a complete orthogonal basis allowing a case-by-case 
analysis for the equivalence.) The positions range from (top) to m — 1 (bottom). Then Vo is controlled by the state 
of the first m qubits and acts as follows: if zi = it does nothing. Otherwise it applies the sequence Vb, . . . , l^i^-i, 
unless zi is not well-defined (that is, if the state of the m control qubits is |0)*^^), in which case it applies all the 
unitaries Vb, Vi, . . . , Vm-i in the segment. For > 0, applies F^^, . . . , Vi^^^-i if ih and z^+i are well-defined; does 
nothing if ih is not well-defined; and applies F^^, . . . , Vm-i if just ih-\-i is not well-defined. It is easy to see that this 
circuit exactly simulates the one in Fig. It still makes m G 0{rT / ^Jel) full queries. 

Note that if the control qubits are in a state of Hamming weight smaller than h then V^, . . . , Vm do nothing and 
can be removed from the circuit. This follows from the above description; it can also be deduced by noting that the 
action of Vh depends on the Hamming weight of the state of the control qubits in the manner shown in Fig.pfb). Full 
queries square to the identity: = 11. For this reason, it is possible to truncate the last m — k queries of the circuit 
specified in Fig.jsja) without changing its effect on superpositions of basis states of Hamming weight bounded by /c; if 



m — k is odd, we need to change the control of the parity-controlled query at the end. For k chosen as in Eq. (25), the 



truncated circuit involves k-\-l G 0(log(T/£:2^3)/ loglog(T/£2s:3)) full queries (which is much less than m). It outputs 
a 0(6:2^3/^)— approximation to the state output by U in Fig. [4j We apply this truncation to all size-m segments in 
the circuit. The low query cost as a function of T from the truncation motivates the error correcting procedure that 
we now explain. 



D. Correcting erroneous fractional queries 

This subsection explains how to correct the erroneous Q^^^^ queries that occur when the measurements in Sub- 
sec. [HB] fail. 

As mentioned in Subsec. II C, the computation is divided into m segments so that md = 1/4. Note that there are 
4T such segments. Before the approximation of the previous Subsection is made, we have: (a) the probability that 
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FIG. 5: (a) Equivalent quantum circuit for the implementation of the circuit IJ within the dashed box in Fig.|4] The last query 
is controlled on the parity ^ of the state of the m control qubits, depending on whether vn is odd or even, (b) Description of 
the unitary t4- Each operation Vh-\^ • • • , Kn-i is controlled on the Hamming weight of the ancilliary state, enclosed by the 
corresponding boxes, being h. We extend the notation so that V-\ — \. 



a size-m segment is successfully completed is at least 3/4; (b) conditional on a segment completing unsuccessfully, 
the expected number of errors is upper bounded by 1/4. Property (b) was obtained by considering that the error 
probability per control qubit is bounded by ^ = T/p (Subsec. II B), and there are m = 1/(4^) control qubits per 
segment. 

We now describe an error correction procedure for each segment that succeeds with an expected number of extra 
segments that is bounded by a constant. Each of these extra segments will be ultimately simulated using the truncation 
explained in Subsec. II C| With this approximation, the expected number of queries is proportional to the cost of the 
original segment computation, namely 0(log(T/6:2^3)/ log log(T/£:2^3))- 

The following analysis is valid for the exact case, without invoking the approximation of Subsec. |II C[ Intuitively, 
the errors between the two approximations accumulate linearly, which is shown rigorously in the next section. 

First, note that, whenever erroneous fractional queries occur, it is known from the ancilla measurements in exactly 
what positions the resulting errors are. Since errors are unitary operations, it is then possible to undo the entire 
segment, and then redo it. At a high level, the undoing operation is implemented by simulating the fractional queries 
and the interleaved driving unitaries in reverse while inserting the Q^^^^ in place of QJ^ wherever an error has 
occurred (this aspect is described in more detail further below). The undo and the redo each succeed with probability 
at least 3/4, but they may each fail. If the undo or redo step fails, we iterate the recovery procedure. For instance, 
if the undo step fails, we must undo the failed undo step, and then redo the failed undo step. If these two actions 
succeed, we can continue with the original redo step from the recovery procedure. Success occurs when all the recovery 
steps are successfully implemented. Figure |6] illustrates the error correction process for an original segment. 

For the implementation of a segment, there are several types of segment-like computations related to it: the original 
segment^ segments corresponding to undo operations for sets of error positions, and the recursive versions of these 
(such as the undo operations related to each unsuccessful undo). We refer to each of these as segments. 

Our first observation is that the expected number of segments that are computed in order to correctly compute 
an original segment is at most 2. To see why this is so, note that the branching process for the error correction in 
Fig. [6] can be viewed as a classical random walk on a line, that moves one step to the right whenever a segment 
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FIG. 6: Branching process for the iterative error correcting step 
probabihstically as described in Subsec 



II B 



We let Cm denote a size-m segment of Fig.|2]to be simulated 
When any simulation fails (F) we attempt to correct it by undoing the failed 
circuit and redoing it with the right operations. The undoing and redoing circuits are size-m^ circuits, with m < m (they 
require operations Qt^). These are also implemented probabilistically with controlled full queries (Subsec. II B), yielding 
a success probability bounded below by 3/4. The dashed boxes and arrows are associated with the nodes and branches of the 
tree, respectively. Successful simulation (S) occurs when Cm is simulated successfully; that is, when the undoing and redoing 
circuits associated to all the visited nodes are simulated successfully (check marks). Variable j denotes the level of the branching 
process. 



computation succeeds (S) and one step to the left whenever a segment computation fails {F). (Upon failure, two 
segment computations are required: the failed segment computation must be undone and then redone.) The random 
walk starts in the state corresponding to the original segment and the goal is to advance one step to the right of this 
state. Since each of the segment computations succeeds with probability at least 3/4, this is a biased random walk 
with average speed (to the right) bounded below by 3/4 — 1/4 = 1/2. The expected number of steps, or segment 
computations, is then A < 2. 

Although the expected number of segment computations for each original segment Cm is bounded by a constant, 
we must take into account that not all segment computations have the same cost. The undo segments become more 
expensive as the number of errors being corrected increases. That is, for each erroneous fractional query, the undo 
segment has to correct all the Qt^^'^ operations obtai ned i n the failed simulation. In order to approximate the segment 



using the truncation procedure explained in Subsec. |II C[ we absorb each error Qt^^'^ into the adjacent Vi unitaries. 



Once we perform the transformation and truncation from Subsec. II C , a single error then gets multiplied into up 
to k operations Qt^^'^^ with k as in Eq. (25). Each of these operations is implemented using two full queries as 
shown in Fig. [l] replacing 0^ by ±7r/2. Therefore, with the approximation in mind, for each error obtained in a failed 
simulation it requires 0(log(T/£2^3)/ log log(T/£2^3)) additional full queries in the undo segment, as well as in each 
of the recursive undo and redo operations that occur if this segment fails. 

We return to the exact case. Let Co denote the expected number of operations Qt^^'^ needed to fix all the errors 
that occur in all segment computations of the branching process, starting from the original segment Cm- For each 
integer a > 0, let qa denote the probability that the initial computation of the original segment results in a errors 
(^0^3/4). As m gets large, approximates a Poisson distribution with mean bounded by mO = 1/4. Also, for each 

a > 1, let Cq, denote the expected number of operations Ot^^^ required to successfully undo the a errors after a failed 
computation of the original segment. Since these a errors will be part of every segment that is associated with this 
undo operation and the expected number of such segments is A, Cq, < Co + OiA\ the Co term denotes the expected 
number of new errors introduced during the undo operation. If we happen to have a > errors in a segment, the 
expected number of operations Ot^^^ we need to perform to correct it is then C^ for the undo step, plus Co for the 
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redo step. Considering all the possible outcomes of a segment simulation, we have 

m 

Co = qo'O^^qaiCa^Co) (26) 

m 

< ^qa{{Co^aA)^Co) (27) 

< 2Co l^fj Qc^j + A l^fj aq^^ (28) 

< (l/2)Co + 2(1/4) (29) 

(where we have used the fact that the expected number of errors is upper bounded by 1/4), which implies that Co < 1. 

In summary, using the error correction procedure, the expected number of segment computations and the expected 
number of operations Q^^^^ needed in the exact case are both 0{T). Using Markov's inequality, the probability of 
successfully terminating the error-correcting procedure can be lower bounded by 1 — £2 by running the procedure for 
0{T/e2) segment computations and operations Qt^^'^ . That is, we set a cutoff for the total number of queries used, 
and terminate the algorithm if this quota is exceeded. The probability of this happening is at most 82- Note that if we 
have a classical input to the algorithm, we can just set 82 equal to a constant. If the error correction procedure fails, 
we start over from the beginning; we can then attempt to run the algorithm 0(log(l/£2)) times, and the probability 
that one or more attempts succeed is at least 1 — 

If we use the truncation of Subsec. |II C| to approximate each segment computation, the expected number of full 
queries is 0(T log(T/£2^3)/ log log(T/£2^3)), and the probability that the error correction procedure terminates is at 
least 1 — £2 if we make 0(Tlog(T/6:2^3)/^2 loglog(T/£:2^3)) queries. In the next Subsection, we give a rigorous proof 
that the resulting final state of the computation has fidelity at least y^T^^^(£7^^^2^^^3) with the final state of the 
continuous-time algorithm, which completes the proof of Thm. [l] 



E. Rigorous analysis of fidelity 



From subsection II A, we have ('0i|'02) ^ ~ where is the output state of the continuous-time algorithm 
and |?/^2) is the output state of the fractional query algorithm. (Assume throughout this section that the overall phases 
of the various states have been adjusted to make all inner products positive.) 

Consider the algorithm with error correction as described in subsection |IID[ assuming that the control qubits are 
in the exact state |^^'^^(^/^2) (^^-[^^^ there is no approximation of this state by |xO^^^^^^^^)- This algorithm can be 
expressed via purification as a unitary operation whose input state includes the control qubit state l^^'^^^^/^^) 
whose output state is of the form 



1^3) = 713^ 1^2) |0) \go) + |1) \g^) , (30) 

where the second register tells us if the error correction procedure succeeded. Note that, if we define \ip2) = |'02) |0) |^o) 
then we have 



{^2m>vi^- (31) 

Finally, if we change the control qubit input state to the unitary operation from l^^'^^'^^/^^) |^/^(g)0(T/£2) g^j^^^ define 
the resulting output state as {i/j^) then 

(V'3|V'4) = {{X\x')r''^^^'''> (32) 

> VT^. (33) 

Combining these results yields 

ii'ili'A) > Vl- 3(^1+ £2 +£3), (34) 

where = IV'i) |0) I So)- This implies that the fidehty between and the corresponding portion oi \ip4} is 1 — s 
if £1 = £2 = £3 = £/9. 
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III. CONCLUDING REMARKS 



We have described an efficient discrete-query simulation of a continuous-time query algorithm. For total evolution 
time T and arbitrary precision, our algorithm requires 0(TlogT/ loglogT) full queries and 0(T^ log T) known uni- 
taries for its implementation, and its probability of success is bounded below by an arbitrary constant. We expect 
that the known operation complexity can be reduced; however we do not do so here. 

As a consequence, lower bounds on the (discrete) quantum query complexity for a function also are lower bounds on 
the continuous query complexity, possibly lower by a sub-logarithmic factor. We note that one can use this simulation 
to show that the bounded-error query complexity lower bound one would obtain by the polynomial method applied 
to the quantum query model is also a lower bound for the fractional or continuous query model, without the loss of 
a log T/ loglogT factor. It is conceivable, of course, that a continuous query algorithm might be able to achieve this 
lower bound when a discrete query algorithm cannot. 

One way to to show that the polynomial method does not lose a factor of log T/ loglogT is as follows. If instead 
of breaking the algorithm up into blocks of size m, we run the algorithm to completion without performing any error 
correction, the total number of full queries used (after truncation) is now just 0(T) rather than O (T log T/ loglogT). 
We can thus use the polynomial method to show that the final amplitudes (after truncation) are polynomials of 
degree in 0(T) in the variables xq, xi, ... xn-i- If, conditional on the final state of each control bit, we perform the 
mathematical operation that effectively fixes things, then this would yield the correct answer for all (low Hamming 
weight) values of the control bits, and doesn't increase the degree of the amplitude polynomials. Even though these 
aren't physical operations, they preserve the norm, and thus we get valid approximating polynomials with the same 
degree upper bounds. One way to fix things is to use variables Xij^ where Xij is the result of querying bit i on query 
number j. The mathematical correction can be to map j = xi if control bit j is 0, and to map j = 1 — if 
control bit j is 1. 

There remain a number of open questions about possible improvements. The total number of queries could possibly 
be reduced further. The minimum possible number of full queries is ^^(T), since a quantum query algorithm with T 
full queries can be simulated by continuous-time algorithm running for time 0(T). It might be possible to eliminate 
the factor logT/ loglogT from the query complexity of our simulation to obtain a tight equivalence between discrete 
and continuous query algorithms. In our simulation, this factor arises from the need to break up the algorithm into 
small segments, each of which can only have small error. However, without breaking up the algorithm in this way, 
too many errors accumulate due to failures of the probabilistic simulations for us to correct. Therefore, it appears 
that a new way of handling the error correction is needed if we wish to remove the extra factor of logT/ loglogT. 

Also, for a computation on an arbitrary initial quantum state (where fast amplification by trivially repeating the 
computation cannot be carried out) a failure probability bound of £ requires a factor of e"^ in the query complexity (by 
our approach, using the Markov bound). Since the branching process in Section II D becomes extinct exponentially 
fast in the generation, we conjecture that this scaling in e can be improved towards 0(log(l/£:)). Such an improvement 
may be useful in some settings. 

Finally, it is an interesting question to see if the number of driving unitary operations can be reduced to correspond 
to the cost of implementing the evolution of the driving Hamiltonian alone (in some reasonable sense). In the most 
general case, with a rapidly varying and strong driving Hamiltonian D{t)^ we do not expect a considerable reduction: a 
general D{t) corresponds to a complicated unitary circuit. However, for better behaved D(t)^ a reduction is expected. 
The case where D{t) is time-independent is particularly interesting, and could have relationships with improved 
Trotter-Suzuki formulas. In this case, all operations Vj in Fig. [2] are identical and the Ri gates from Fig. [i] and Vh 
gates from Fig. 5] can all be done using only 0(T polylog T) unitaries. Unfortunately, we do not know how to also 
reduce the number of R2 gates, so it remains an open problem whether the number of unitaries can be reduced to 
0(T polylog T) even in the case of a constant driving Hamiltonian. 
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